Function Approximation for Solving Stackelberg Equilibrium in Large Perfect Information Games

نویسندگان

چکیده

Function approximation (FA) has been a critical component in solving large zero-sum games. Yet, little attention given towards FA general-sum extensive-form games, despite them being widely regarded as computationally more challenging than their fully competitive or cooperative counterparts. A key challenge is that for many equilibria no simple analogue to the state value function used Markov Decision Processes and games exists. In this paper, we propose learning Enforceable Payoff Frontier (EPF)---a generalization of We approximate optimal Stackelberg correlated equilibrium by representing EPFs with neural networks training using appropriate backup operations loss functions. This first method applies setting, allowing us scale much larger while still enjoying performance guarantees based on error. Additionally, our proposed incentive compatibility easy evaluate without having depend self-play best-response oracles.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Rationality and equilibrium in perfect-information games

In generic perfect-information games the unique Subgame-Perfect Equilibrium (SPE) outcome is identical to the one predicted by several rationalizability notions, like Extensive-Form Rationalizability (EFR), the Backward Dominance Procedure (BDP), and Extensive-Form Rationalizability of the Agent form (AEFR). We show that, in contrast, within the general class of perfect information games all th...

متن کامل

Abstraction for Solving Large Incomplete-Information Games

ion for Solving Large Incomplete-Information Games Tuomas Sandholm Computer Science Department Carnegie Mellon University

متن کامل

Solving Stackelberg games with uncertain observability

Recent applications of game theory in security domains use algorithms to solve a Stackelberg model, in which one player (the leader) first commits to a mixed strategy and then the other player (the follower) observes that strategy and bestresponds to it. However, in real-world applications, it is hard to determine whether the follower is actually able to observe the leader’s mixed strategy befo...

متن کامل

Endgame Solving in Large Imperfect-Information Games

The leading approach for computing strong game-theoretic strategies in large imperfect-information games is to first solve an abstracted version of the game offline, then perform a table lookup during game play. We consider a modification to this approach where we solve the portion of the game that we have actually reached in real time to a greater degree of accuracy than in the initial computa...

متن کامل

Solving Large Imperfect Information Games Using CFR+

Counterfactual Regret Minimization and variants (e.g. Public Chance Sampling CFR and Pure CFR) have been known as the best approaches for creating approximate Nash equilibrium solutions for imperfect information games such as poker. This paper introduces CFR, a new algorithm that typically outperforms the previously known algorithms by an order of magnitude or more in terms of computation time ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2023

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v37i5.25715